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Abstract: Models of axion inflation are particularly interesting since they provide a natu- 
ral justification for the flatness of the potential over a super-Planckian distance, namely the 
approximate shift-symmetry of the inflaton. In addition, most of the observational conse- 
quences are directly related to this symmetry and hence are correlated. Large tensor modes 
can be accompanied by the observable effects of a the shift-symmetric coupling (pFF to a 
;-H gauge field. During inflation this coupling leads to a copious production of gauge quanta 

^ and consequently a very distinct modiflcation of the primordial curvature perturbations. 

I— I In this work we compare these predictions with observations. We flnd that the leading 

,__! constraint on the model comes from the CMB power spectrum when considering both 

K>" WMAP 7-year and ACT data. The bispectrum generated by the non-Gaussian inverse- 

ly^ decay of the gauge held leads to a comparable but slightly weaker constraint. There is also 

O a constraint from ^-distortion using TRIS plus COBE/FIRAS data, but it is much weaker. 

Finally we comment on a generalization of the model to massive gauge fields. When the 
mass is generated by some light Higgs field, observably large local non-Gaussianity can be 



o 

CN produced. 
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1. Introduction 

Inflation is a UV sensitive mechanism in that it drastically depends on the size of Planck 
suppressed operators up to dimension six or more. One of the main objectives of inflation- 
ary model building is to find natural constructions in which these operators have values 
compatible with current observations. If we now further focus on inflationary models that 
can produce observable tensor modes, in which the scale of inflation is close to the GUT 
scale, the UV sensitivity becomes even more pungent: there is an infinite number of higher 
dimensional operators that need to be small in order for inflation to take place at all. Fac- 
ing these obstacles, the use of symmetries constitutes a very promising way to construct 
successful and robust models. 

Axions enjoying a shift symmetry to all orders in perturbation theory have been recog- 
nized long ago [1] as promising inflaton candidates. Since then many different constructions 
have been proposed [2,4]. Without getting into the very interesting and challenging de- 
tails of model building, our focus here is on phenomenological consequences that all this 
class of models have in common. The assumption of a shift symmetry already puts strong 
constraints on the allowed interactions of the inflaton. One shift-symmetric interaction 
that has so far proven to be extremely relevant for observation is a (pFF coupling of the 
inflaton to a gauge field. On the one hand, this coupling can be responsible for a non- 
perturbative periodic correction to the inflaton potential^ [4]. The consequences of this 
oscillatory correction have already been discussed elsewhere [4,5,7] and will not be further 
discussed in the present work. On the other hand, during inflation the cpFF coupling leads 
also to a window of tachyonic instability for the gauge fields, which are then copiously 
produced. There is a regime in which these gauge quanta are abundant enough to affect 
the background dynamics at the time when CMB scales left the horizon as for example 
in warm inflation [3,12]. We will not consider this regime since it requires non-minimal 
modifications in order to be compatible with the data. Instead, following [29] we focus 
on the more conventional regime in which most of the exponential expansion is driven by 
standard slow-roll inflation and the gauge quanta only affect the perturbations. We allow 
for some additional e-foldings driven by dissipative effects as long as this takes place at the 
end of inflation, i.e. for scales too small to be observable. 

There are several observational channels. Through an inverse decay process two gauge 
quanta can produce an inflaton perturbation hence affecting all scalar primordial N-point 
functions [29] (to be reviewed in section 2). The effect on the power spectrum is a blue- 
tilted contribution, which grows on smaller scales. Since the inverse decay produces non- 
Gaussian inflaton perturbations, the bispectrum is non-vanishing and peaks on equilateral 
conflgurations. Gauge fields also source tensor modes. The effect is negligible at CMB scales 
but grows on smaller scales, such that it is observable by gravitational wave interferometers 
such as Advanced LIGO/Virgo [10,11]. 



^The origin of tiie non-perturbative correction in explicit string constructions is somewhat different, but 
leads to a similar low energy effective theory. 



COBE/FIRAS 



flat prior ^* < 3.6 

log prior ^=k < 3.2 



WMAP7 fNL 



C* < 2.45 
C* < 2.22 



WMAP7 



^* < 2.66 
C* < 2.51 



WMAP7+ACT 



C. < 2.41 
C* < 2.15 



Table 1: The table summarizes the various 95% CL constraints we derived in this work for the 
quadratic model of section 1.1. 



WMAP7 



flat prior S,* < 2.5 
log prior S,* < 2.18 



WMAP7+ACT 



g. < 2.5 
C* < 2.14 



Table 2: The table summarizes the various 95% CL constraints we derived in this work for the 
generic model of section 1.1. The flat prior is both less theoretically motivated and gives rise to 
convergence issues discussed in section 5.1, so one should mainly focus on the results for the log-flat 
prior, as far as the generic model is concerned. 



1.1 Summary 

For the reader excited mostly about our main results rather than the details of the analysis, 
we commence this paper with a summary. We compare the predictions of axion inflation 
allowing for a shift-symmetric coupling 
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with observations. Here / is the axion decay constant with dimensions of mass and a is the 
dimensionless size of the coupling. Let us stress that in order to have successful reheating 
the inflaton should couple to something. The particular coupling in (1.1) is a natural one 
to consider since it is allowed by all the symmetries of the theory. 

As mentioned in the introduction, gauge perturbations are copiously produced during 
inflation due to the time variation of the inflaton (p. We focus on the phenomenological 
consequences of the inverse decay of gauge quanta (A) into inflaton perturbations {S(p). 
We present our final results as constraints on the combination f /{Mpu) or equivalently on 
the parameter 
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For concreteness we quote the bounds on ^^ = ^{K), where fc* = 0.002 Mpc^^ is our choice 
of the pivot scale. We consider two different theoretical priors. In both cases the choice 
of the upper bound is mostly irrelevant^ as long as it is much larger than S,* ~ few, since 
the posterior distribution drops to zero extremely fast for C* > 3 (see figure 10). The two 
priors we consider are: 



^The upper bound is relevant if one wants to compute the Bayesian evidence. In all cases we consider, 
the evidence for a model containing the effects of gauge production is smaller than the evidence of the 
standard ACDM model as can be seen from the marginalized likelihood for ^« in figure 10. 



Flat prior: Constant prior on S,* in the interval {0 — 10}. This is more practical for 
the data analysis but is theoretically less motivated. In terms of physical parameters this 
choice corresponds to a flat prior on the axion decay constant, but we have no reason to 
believe that this should be fixed at some precise scale. 

Log- fiat prior: Log-constant prior on ^=k in the interval {10~^ — 10^}. We feel that this 
choice is better motivated theoretically. It corresponds to having some fixed a ~ 0(1) and 
a log-flat distributed axion decay constant in the interval^ (2/3) x 10^'^ < f/Mp < 2/3. 
The lower (upper) bound of this ^=k {f /Mp) interval is motivated by the fact that super- 
Planckian axion decay constants seem hard to obtain in UV finite theories of gravity [13]. 
The lower bound is somewhat arbitrary. 

For the comparison with the data we take the power spectrum to be [29] 

A|(fe) = Al^^^ik) [l + Al,,{k) /2(0 e^-«] , (1.3) 

where 

{n{k)n{k')) = {2T:fPn{k)5^ (k + k'') = (27r)3 ^^'^^^^^% 3 {k + k'') , (1.4) 

Al^^ik) ^ Al{k)\.-o = f— V S^ = ^*. (^Y~^ , (1-5) 

^'''^ ' ^^ ^'^ \27tJ |0|2 (27r)2|,/,,|2 \kj ^ ' 

and /2(0 is defined in (A. 18) and is handled numerically in the data analysis. Here all 
time dependent quantities should be evaluated at horizon crossing of the relevant mode 
and a star refers to the pivot scale /c* = 0.002 Mpc~^. The slow-roll expansion gives a good 
approximation already at linear order 

Us — 1 = -2e* - r]* ~ Gey + 2r]v , (1.6) 






+ 0{e'), (1.7) 



where 

e = -H/H\ ,^ = e/{eH), (1.8) 

while potential slow-roll parameters are defined by 

e^-^(yj , r^v^Ml-. (1.9) 

In order to numerically evaluate (1.3) we need to specify an inflaton potential. In the rest 
of our analysis we consider two models: 

Quadratic Model: We assume a quadratic inflaton potential y{<\>) = m?4)^ /2 and fix 
the number of e-foldings A^ between the end of infiation and when the pivot scale A;* leaves 
the horizon to be A^ = 60. We can then solve the background homogeneous equations of 
motion and compute H, (j){t) and £,{t). This fixes the slow-roll parameters in (1.7). Then 
the primordial power spectrum (1.3) contains only two parameters: {^*, A^gj,(A;^,)}. 



^The factor of 2/3 is chosen to get a nicer interval in ^, . 



Generic Model: Instead of specifying a potential, we can vary the slow-roll parame- 
ters in the fit. The two additional parameters can be chosen to be {e*,r/*} or equivalently 
{ns,r} in (1.7). In this model the primordial power spectrum (1.3) is described by four 
parameters: {^*, A^g^(/c*), e*,r/*}. 

Currently the strongest bound on the inverse decay phenomenology comes from WMAP7 
[30] plus ACT [41] data (see section 5.1 for details). For the quadratic model this is 
^* < 2.15 or equivalently f/a > 0.032Mp at 95% CL for a log-flat prior. For a flat prior 
the same data leads to ^* < 2.41 or equivalently f/a > 0.035Mp at 95% CL. The model 
independent constraints are remarkably similar. As originally derived in [29], the current 
bound coming from non-Gaussianity can be estimated from the analysis of the equilateral 
template (with which the inverse decay template has a cosine of 0.94). As discussed in 
section 4, we find ^* < 2.45 and ^^ < 2.22 at 95% CL for a flat and log-flat prior, respec- 
tively. We also consider CMB spectral distortion of the /^-type and find that using the best 
current data from COBE/FIRAS leads to a constraint which is much weaker than those 
from temperature anisotropics. For convenience we have collected all the bounds in tables 
1 and 2. The forecast constraints from Planck and ACTPol are summarized in table 3. 

Our analysis shows that in the region of parameter space allowed by constraints on 
the scalar power spectrum, non-Gaussianity from inverse decay is currently unobservable. 
On the other hand, in the case of a detection of an upward bent of the power spectrum at 
small scales, inverse decay non-Gaussianity would be an important signal to be looked for 
in the data^. 

Finally in section 7, we consider a generalization of the model in which we allow for 
the gauge field A to get a mass from the vev of an additional Higgs-like field h. Quantum 
oscillations of this extra field can convert into adiabatic curvature perturbations. The 
phenomenologically interesting (but technically unnatural) regime is when h is light as 
compared to Hubble. In this case large non-Gaussianity of the local type can be produced. 
The constraint on ^* then depends on a combination of the vev of h, the scale of inflation 
and the gauge coupling. 

2. Review of inverse decay in axion inflation 

In the section we review how the presence of the coupling (1.1) affects the curvature 
perturbations. We focus on the results relevant for our analysis and refer the reader 
to [10, 29] and appendices A and B for further details on the derivation. We consider 
the effective Lagrangian 



S = — d^x-v/— ^ 



^^R+^^d^^ +^-F' + af-^FF + V{<t>) 



(2.1) 



where the inflaton <j) enjoys a shift-symmetry to all orders in perturbation theory. We 
assume that the gauge coupling is such that non-perturbative effects are negligibly small 

*This is somewhat analogous to the results presented in [7] , albeit for a different set of phenomenological 
signatures of axion inflation, where the authors point out that a periodic modulation of the inflationary 
potential lead to a parametrically larger signal in the power spectrum compared with the bispectrum. 



and refer the reader to [4,5,7] for the study of a different regime. Assuming a homogeneous 
and sfowly evolving background, one can solve the equations of motion for A. This solution 
can be used to compute the contribution of the gauge field to the homogeneous equations 
of motion 

(P + 3H(I) + V' = -{E-B), (2.2) 

3H^M^ = ^cj)'^ + V + ^{E^ + B^). (2.3) 

where the two expectation values on the right-hand side can be estimated to be 

(E-B)::, -2.4 • 10-^ ^ e^< , {^^±^) :^ 1.4 • IQ-^^e^-? , (2.4) 

with ^ = a(/)/{2fH). One can check that the contribution to the (p equation of motion 
(2.2) is always more important than the one to the Friedman equation (2.3). Given that 
^ oc \/2e, it generically grows as inflation proceeds. When ^ > 5 the gauge field starts back- 
reacting sizably on the homogeneous evolution. Here we consider the regime in which ^ ^ 3 
when the observable CMB scales left the horizon, in which case the effects of the gauge 
fields are relevant only for the perturbations. In general ^ will grow large towards the end 
of inflation, acting as an effective friction term for (f). This generates additional e-foldings 
that we consistently take into account by solving the equations of motion numerically as 
explained in [11]. 

To compute the primordial curvature perturbations one needs to solve a linearized 
equation for the Sep perturbations. Schematically one flnds dcp = dcpsr + ^(pid, where the 
former contribution is the one of standard for slow-roll inflation while the latter is generated 
by the inverse decay of gauge field perturbations. Converting to curvature perturbations 
7^ (see (A.9)), one finds the power spectrum [29] 

{n{k)n{k')) = {27rf6 (k + fc') Pn{k) = {2TrfS (k + k'^ ^"^ ^3^^^^ , (2.5) 

A|(fe) = A'^,sM [1 + A^^M /2(0 e^-«] , (2.6) 

where /2 is defined and discussed around (A. 18) and 

A^ m= ^' - ^- rM"" (2 71 

^'"■"^ ^ (27r)2|0|2 (27r)2|</,,|2 \kj ^ ■ ^ 

is the power spectrum in the absence of a coupling to gauge fields. We remind the reader 
that here and in the rest of the paper we denote with a star a time-dependent quantity, 
such as^ H{t), (J){t) or e(r), evaluated when a certain fixed pivot scale k^, crosses the 
horizon. All time dependent quantities without a start should be evaluated at horizon 
crossing of the relevant mode, i.e. when —rk = 1. Notice that what is most crucial for the 
comparison with observations is the scale dependence of (2.6), since the overall amplitude 



^r is conformal time defined hy a dT = dt. 



is undetermined in the inflationary mechanism. The largest violation of scale invariance 
comes typically from the scale dependence of ^, given in (1.7). For the suspicious reader 
we have collected in appendix A some computational details leading to (2.6). 

One can also compute the bispectrum [29]. As argued in appendix B we will consider 



{-R") = ^(2^)' 



J[e'<^'^^^l^^Ah) 



.1=1 



KY. ^«)S||/3(^' ^2Ai, fcg/fci). (2.8) 



n^f 



/3(C) ^2/^1, ^3/^1) is a function similar to /2 first derived in [29] and the full expression 
can be found in appendix B. 

3. Constraints from spectral distortion 

In [21] it was pointed out that bounds on (or measurements of) the /x-type spectral dis- 
tortion of the CMB provide a direct bound on (or measurement of) the log-integral of the 
primordial scalar power spectrum in the approximate range 50 < /c x Mpc < 10^. Recently 
this and other aspects of spectral distortion have experienced a revival [8,9,16,17,19]. 
Some useful older references are [22,23]. Since the inverse decay of gauge fields into scalar 
perturbations leads to a blue tilt in the primordial power spectrum it is natural to ask what 
the constraint from ^-distortion is^. We show in the following that the current constraints 
(from TRIS [14] COBE/FIRAS [15]) are much weaker than those from the CMB power 
spectrum and bispectrum, as we will see in the next section. For this reason we limit the 
analysis to the quadratic model. The main conclusion similarly holds for a generic model. 

3.1 Review of /i-type spectral distortion 

We start with a quick review of the relevant CMB physics following [19-21]. At early 
times, 2; S> 2 X 10^, the occupation number of photons is extremely well described by 
a black body formula. As the universe expands and cools down, interactions become 
less efficient. After z ~ 2 x 10^ equilibrium relies mostly on elastic Thomson scattering. 
Since Thomson scattering takes one photon in and gives one photon out, effectively the 
number of photon becomes frozen. Let us now consider perturbing the system by adding 
some energy 5E. Thomson scattering redistributes the energy (kinetic equilibrium), but 
there is a conserved number, so the equilibrium distribution is a Bose-Einstein distribution 
with a non- vanishing frequency-dependent chemical potential ^{v), rather than a black 
body spectrum. A Boltzmann equation for ^i{i') can be written down (Kompaneets [25]) 
and solved. The result is that /i(i^) is almost constant except for very low frequencies, 
where light photons can be produced at low energy cost. As the expansion of the universe 
proceeds, even elastic Thomson scattering is not efficient enough after z ~ 5 x 10^ to 
redistribute energy and momentum if the system is perturbed. There is a smooth transition 
to a situation in which an energy injection into the system produces a deformation of 
the photon spectrum quite different from the one due to //-distortion, which then can be 



®Close to the completion of our work, [8] appeared where analogous constraints are computed for a few 
other classes of models. 




10" 



0.01 



1 100 

k [Mpc-^] 



10' 



Figure 1: The plot shows the primordial scalar power spectrum including Silk damping at three 
different times. More specifically we plot A^(fc)e~'^"/'^"(^' for z = 1100, 5 x 10**, 2 x 10^. Measure- 
ments of ^-distortion are sensitive to the red region on the right side, i.e. much smaller scales than 
those probed by large scale structures and temperature anisotropics (on the left). Figure taken 
from [17]. 



distinguished observationally. Finally after the photons decouple, around z ~ 1100, any 
imprinted distortion is maintained and can be observed in the CMB. 

The energy injection and associated spectral distortion that we want to consider is the 
one due to dissipation of adiabatic acoustic waves. As primordial superhorizon perturba- 
tions re-enter the horizon during radiation domination they start oscillating and eventually 
they dissipate part or all of their energy due to diffusion damping [26]. The energy that 
is dissipated and hence injected into the photon-baryon plasma from Zi = 2 x 10^ to 
Zf = 5 X 10^ leads to ^u-distortion. 

3.2 Constraints on inverse decay from /i-distorsion 

Very precise predictions for /^-distortion are best obtained by numerically evolving the 
Boltzmann equations for the photon-electron-baryon plasma. Here, on the other hand, we 
will make the necessary approximations in order to be able to treat the problem analytically. 
We believe that our main conclusions, i.e. that the bound from /i-distortion is weaker than 
those from temperature anisotropics, are unaffected by these simplifications. The relations 
we will use is [21] 



/i ~ 3 



=D,/ 



d\nkA^{k) 



'kyki(z) 



(3.1) 



0.001 




Figure 2: The plot shows /i as function of ^*. /i < 6 x 10 ^ is the 95% CL exclusion contours 
from the combination of TRIS [14] and COBE/FIRAS [15], while from forecasts of the PIXIE 
experiment [28] one gets /i < 2 x 10"^ at 95% CL. 



where kfj^z) ~ 4 x lO^^z'^' ■^ Mpc~ is the damping scale. As we reviewed in section 2, the 
primordial scalar power spectrum in models of axion inflation accounting for inverse decay 
of gauge fields is given by (2.6). 

Using (2.6) and (3.1) we derive the dependence of //-distortion on S,* for the quadratic 
model of section 1.1. This is plotted in figure 2 together with the most recent constraint 
from TRIS [14] plus COBE/FIRAS [15], i.e. /i < 6 x lO'^ at 95% CL, and the constraint 
forecasted for an experiment like PIXIE [28], i.e. fi < 2x 10~® at 95% CL. More specifically, 
the plot shows fi as function of ^*, i.e. the value of ^ when the CMB pivot scale /c* = 
0.002 Mpc~ left the horizon during infiation. In order to translate the constraints from 
TRIS plus COBE/FIRAS, we assume that the marginalized likelihood for /i is a Gaussian 
centered in zero with variance o"^ = (3x 10~^)^, corresponding to the constraint fi < 6x 10~^ 
at 95% CL. Given that the model only gives positive //-distortion, we set the likelihood to 
zero for // < 0. Using a flat prior on ^=|,, as discussed in section 1.1, the 95% CL constraint 
from TRIS plus COBE/FIRAS is ^c < 3.6. Using a log-flat prior in the interval {10-^ lO^}, 
one finds ^* < 3.2 at 95% CL. 

An experiment like PIXIE, whose expected sensitivity is /i ~ 2 x 10~® at 95% CL, 
would be able to detect the /i-distortion coming from this model. On the other hand, it 
would still not be possible to obtain a significant detection of non-vanishing ^* from just 
this probe. The reason is that, even taking a fiducial signal as large as allowed by current 
data ^4, = 2.41 (see section 5.1), the amount of /x-distortion is not significantly increased 
by the presence of the inverse decay effect as compared to the standard slow-roll case. 

Summarizing, the constraint from /t-distortion is currently weaker than those from the 
CMB spectrum and bispectrum to which we turn next. 



4. Bispectrum constraints 

In this section we estimate the constraint on ^* from the CMB temperature bispectrum. 
We follow the approach of [29] where the constraint from non-Gaussianity was first derived. 



We make a few different technical choices, which reflect in shghtly different but comparable 
results. For the flat prior (see section 1.1) on ,^* we find ^* < 2.45 at 95% CL, corresponding 
to /^^ < 13. For the log-flat prior (see section 1.1) on ^* we flnd ^* < 2.22 corresponding 
to /^^ < 0.4 at 95% CL. 

We gave the result for the primordial scalar bispectrum in section 2 (details on the 
derivation are left to appendix B). By inspection the inverse decay bispectrum is largest 
in the equilateral limit, i.e. when /ci ~ A;2 ~ ^3. So instead of performing a dedicated bis- 
pectrum analysis we import and appropriately normalize the constraint on the equilateral 
non-Gaussian template. As first discussed in [29], this can be done following [31], i.e. by 
computing the three-dimensional overlap of the two different shapes. This approach gives 
just an estimate of the non-Gaussian constraint on S,*, but given the similarity of the two 
shapes we expect it to be reasonably accurate. 

Let us start with a bispectrum 

(7^(fcl)7^(fe2)7^(^3)) = {^nfs ( J^^^J fNLFikiMM). (4.i) 

To fix the ambiguity of moving numerical factors between F and f^L in this definition we 
impose 

F{K,K,k,) = &\Pn{Kf, (4.2) 

5 

where k^ = 0.002 Mpc~ and the numerical factors are chosen to agree^ with the way /jv^ 
is defined in numerical analysis, e.g. [30,36]. Before applying this to (2.8) let us consider 
what the leading deviations from scale invariance are. /a, discussed in appendix B, depends 
in principle on each one of fci, k2 and fca separately. There is also a dependence on /cj in 
g67r5 and A^gj,(A;j), where the scale dependence of ^ to first order in slow roll is given in 
(1.7). Let us estimate the dependence on an overall change in scale, affecting all ki in the 
same way. For various terms appearing in the three-point function one finds 

Q ^^ Q 

log /s - ^7^/3 ~ -4??* , (4.3) 



d log ki H d^' 

-^loge6-€W =3<,r7,, (4.4) 

dlogki 

^ log A|.,,(A:) = 3(n, - 1) = -3(2e. + r/,) , (4.5) 



d log h 

where for the estimate in (4.3) we have used the large-^ limit /s oc ^~^ derived in [29]. In 
the regime of interest^ ^ ^ 2 and e^, ~ 77^, ~ 10~^, the exponential generates the strongest 
running. The second strongest running comes from the power spectrum. For example, 
in the slow- roll quadratic model e^. = rjy = 77* /2 and hence the scale dependence can be 
quickly estimated using ry* ~ 2e* — r/y ~ e*. Finally, f^ induces the mildest running. One 

''One needs to take into account the conversion factor (" = 3"I>/5. 

*It is true that in our analysis ^, is varied all the way to zero, but all the effects of the inverse decay are 
exponentially suppressed for 5* < 2, so in that regime the small corrections due to running are irrelevant. 
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can therefore safely neglect the scale dependence in f^ which can then be approximated by 
/aC?*) ^2/fci) ks/ki) defined in (B.9). On the other hand, in the following we will keep the 
scale dependence in e^'^^ and A^^^{ki). 

Using the normalization (4.2) and (2.8) one finds 



fid 
JNL 



Tiid 



^n,sv\ 



)e'"«*/3(e*,l,l) 






n 



, \ TT£,tri,+ns-l 



(4.6) 
(4.7) 



where we have collected all the deviation from scale invariance inside the shape function 
F . The equilateral template used by the WMAP collaboration is defined as 



j?<^<} 



10 



(27r)M^(A;,)3 



1 



kik2 



pl.2^,2 



12 V * 

k^ 



n.s-l 



2 perm's+ 



(4.8) 



+ 



"'I '^0 '^'^ 



-L L2p\ K-l)/3 

rvj^ Ajo rvn 



+ 5 perm's 



and has been constrained to -214 < f^^ < 266 at 95% CL using WMAP 7-year data [30]. 
A normalized scalar product, also known as "cosine", can be defined [31] between two 
shapes Fi and F2 by 



cos(Fi,F2) = 



Fi ■ F2 



V-Fi • -F1V-P2 ■ F2 



(4.9) 



where the dot product is defined to mimic the scaling^ of the optimal three-point function 
estimator [18] 



F-i ■ Fo 



dkidk2dk2, 



{kik2k3 



ki + k2 + h- 



Fi{ki,k2,k3)F2{ki,k2,k3 



(4.10) 



Here /cmax and fcmin are the smallest and largest scales relevant for a given experiment. 
The cosine can be used to import bounds from one shape, e.g. the equilateral template, 
to another, e.g. the inverse decay shape, which has not yet being compared with the 
data. After appropriately taking into account the potentially different "volume" of the two 
shapes, from a given constraint on f^^ one deduces a constraint on /^^ using [31] 



^fh^L 



Ar 



NL 



cos(Fe'?^i?*d) V F^^.F**^ • 



peg . peg 



(4.11) 



Numerically one finds cos (F^*?, F*'') = 0.935 and F'"! ■ F^V^''^ • ^''^ ~ 5. While the 
result for the cosine is very robust, the ratio of the volumes depends mildly on /cmax- 
As we vary it e.g. from to 0.03 to 0.05 Mpc~ (corresponding to I ~ 400 and / ~ 700, 



^Numerically one finds that introducing or not the weight factor (fci + k2 + ks) ^ changes the result less 
than 1%. 
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Figure 3: As function of ^*, the plot shows fj^j^ (continuous black line) and the likelihood pro- 
duced by the constraints on non-Gaussianity (dashed red line). For ^* < 2.4, Pj^j^ ^^ and the 
likelihood is completely flat. For ^* > 2.7, /^^ 3> 100 and the likelihood drops to zero. The 
small peak corresponds to /^^ = 12 (thin dashed line), which is the central value for equilateral 
non-Gaussianity rescaled by the fudge factor in (4.11). 

respectively), the volume ratio changes from 4.3 to 5.3. For the following estimates we 
take F'^'i ■ F'^i^p^d, . jpid _ ^^ corresponding to the largest range of scale relevant for 
WMAP, which results in the most stringent bound we could possibly set. Even with this 
conservative estimate, as we will see, the constraint is still weaker than the one from the 
power spectrum. 

Using these results, the 95% CL constraint on the inverse decay template is —100 < 
fNL ^ ^'^'^- Notice that we have rescaled both the error bars and the central value by the 
fudge factor in (4.11), so that the statistical significance of the Gaussian null hypothesis is 
not changed^*^. Despite the fact that the constraint on /^^ is weakened with respect to that 
on f^j^ by the cosine being smaller than one, the inverse decay shape has five times more 
"volume", resulting in an overall stronger bound than that for the equilateral template. 
Let us emphasize that the bounds quoted above assume a flat prior on /nl- This is a very 
reasonable prior when looking in the data for any deviation from the standard picture. On 
the other hand, in this work we are considering a specific class of models, axion infiation, 
and hence we have theoretically motivated priors which are quite different (see section 1.1). 

To a good approximation the likelihood of f}^]^ is a Gaussian with standard deviation 



56 and centered at / 



id 
NL 



12. In order to derive a bound on the parameter ^* 
using fj^^ (^* ) , we have to take into account the fact that in the model under consideration 
fl^liC*) is strictly positive. We do this by setting the likelihood to zero for /^^ < 0. For 
a flat prior on ^*, we integrate ^(/^/^(C*)) ^^ d^* over the interval {0 — 10} . The resulting 
bound at 95% CL is ^* < 2.45 corresponding to /^^ < 13. For the more theoretically 
motivated log-flat prior (discussed in 1.1) we integrate vC(/^£_(^*)) in dlog.^* over the 



^°This is important since otherwise the constraint on ^* would depend on the normalization of _F"^, which 
is unphysical. 
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interval {10"\l02}. We find ^* < 2.22 corresponding to fj^^ < 0.4 at 95% CL. As we 
will see in the next section, these bounds are weaker than those obtained from the CMB 
temperature power spectrum. 

The reader might wonder why the bounds on f^j^ are much stronger assuming a flat 
and even more a log-flat prior on S,* rather than a flat prior on fiy^ as done in e.g. [30,36]. 
The reason is clear. For most values in the space of fundamental parameters, say a and / 
or equivalently ^=k, the model gives either no non-Gaussianity or is completely incompatible 
with the data. Only in a narrow window around i^=k ~ 2.5 can one have a potentially 
observable signal that has not yet been ruled out. So, within this model, f^^ ~ 100 is 
rather unlikely compared to /)^^ ~ or /]^^ > 10^. 



5. Power spectrum constraints 

In this section we constrain ,^* by comparing the theoretical predictions for the temper- 
ature power spectrum with CMB data from the Wilkinson Microwave Anisotropy Probe 
(WMAP) and the Atacama Cosmology Telescope (ACT). We considered both the quadratic 
and generic models introduced in section 1.1. The results are summarized in tables 1 and 
2 for each model, respectively. Our main flnding is that the strongest constraint comes 
from WMAP + ACT power spectrum for both the quadratic and generic model and that 
the constraint is stronger than the constraint from non-Gaussianity. 

5.1 Analysis 

We modified the pubhcly available version of COSMOMC [37] and CAME to run with the 
power spectra discussed in section 1.1 for the quadratic and generic model, respectively. 
When running the Markov Chain Monte Carlo (MCMC) we took a conservative Gelman 
and Rubin bound ofi? — 1 < 0.01 [40]. The Gelman- Rubin diagnostic R relies on parallel 
chains to test whether they all converge to the same posterior distribution by considering 
the variance of the parameters in each chain compared to the variance of the same parame- 
ters over all parallel chains. Convergence is diagnosed once the chains have 'forgotten' their 
initial values, and the output from all chains has become indistinguishable (i? — 1 = 0). 
In particular we aim at a sufficient convergence of S,*- To test convergence, we run 4 in- 
dependent chains. For the quadratic model we computed the power spectrum numerically 
for a sufficient number of k and /2 (as a function of .^* ) . We used a 2D spline interpolation 
inside CAME to call for arbitrary combination of k and .^*. Note that the approximation 
used to compute /2 in [11] is only valid for ^^, > 1. We used the exact Whittaker solution 
in (7.13) in order to extend the range to i^* = 0. For analyzing the generic model we use 
(1.3), to first order in slow-roll. Again, we call /2 as a function of ^* through a ID spline. 
In principle some issues could arise when analyzing the generic model of section 1.1. If 
Ug = 1 then e* = — ry*/2, which allows for the solution ?/* = e* = 0. In this case the power 
spectrum is exactly scale invariant to first order in the slow-roll expansion. Hence the 
constraint on ^* will be undetermined (the correction to the single-field slow-roll solution 
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is simply a renormalization of the total power). For WMAP and ACT^^, rig = 1 is still 
allowed by the data, which would suggest we could run into this issue when scanning the 
multidimensional likelihood. In addition, we expect slower convergence due to a doubling 
of the initial parameters {^*, Ag} — )■ {^*,A5,e*, r/*}. Another potential issue becomes 
apparent only when considering lensing. For large enough {r/*,^*} the spectrum becomes 
extremely blue tilted at small angular scales. Obviously, such a combination is not allowed 
by current data, and in principle this combination should be rejected. However, when 
considering lensing, a too large power spectrum invalidates some of the approximations 
made (e.g. small deflection angles) in the lensing computation. Therefore, typically one 
sets an upper bound A'^{kmax) < 10^^. In order not to exceed this bound, we would 
have to strictly limit our prior values of r/* and .^*. Unfortunately, we can not constrain 
the priors on r/* and ^* too much, since a large value of one of these two parameters 
alone is still allowed by the data. Hence we set a hard bound on the power spectrum of 
'^Tlif^max) < 10~^, i.e. if the power spectrum becomes larger than this, we set it to be 
10-7. 

The quadratic model generically predicts tensor modes, which we include in the data 
analysis. We assume the slow-roll result A|(A;) = 16e*A^gj.(fc)(fc/fc*)"* and nt = — 2e*. 
For the generic model, the constraints on the contribution from tensor modes to the tem- 
perature power spectrum, allows to break the degeneracy between e^: and r/*. The prior on 
r/* is set such that .^ > for all observable k. Let us now discuss the priors on (^^ in some 
more detail. 

5.2 Priors 

Before we discuss our results, let us briefly remind ourselves of the concept of bayesian anal- 
ysis. In bayesian terminology we are interested in extracting the values of our parameters 
given the data, i.e. 

P(Z)|M(0,))P(0,) 

pmD) = — j^^ — . (5.1) 

Here D represent the data, 0j are a set of parameters describing a model M. P{@i) 
describes the parameter priors , while P(D\M(Qi)) is known as the Likelihood function C, 
i.e. the probability of the measured data given the model M with parameters 0j. For the 
CMB this translates to [38] 

P(Gi|Q) = C{Ci\Ct''{ei))P{ei), (5.2) 

where Ci is the best estimator of the true Q , and Cj is the model predicted value of the 
power spectrum, given the parameters Gj. The denominator P{M) is of no interest as it 
does not depend on the parameters, in fact it equals the integral of the numerator over the 
parameter space. Consequently, it only affects the overall normalization of the posterior. 

One peculiarity of bayesian analysis is the choice of priors. Priors can be based on 
previous or independent experiments. For ^* no such independent constraint exist, but 



^^In analyzing ACT data we also include the SZ amplitude Asz, the amplitude of clustered point sources 
Ac and the amplitude of Poisson distributed point sources Ap. 
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Figure 4: The primordial power spectrum for ^* = 0, 2.68, 2.8, 3 for a quadratic potential. 

we do have some theoretical prejudices on the value of ^=i<. It is usually a save bet to put 
a flat prior, i.e. equal intervals in parameter space have equal likelihood, but it is most 
certainly not always a natural choice. In this case ^* is not the parameter appearing in the 
fundamental Lagrangian describing the theory. The microscopic parameters are instead 
the axion decay constant / and q, which are related to ^^ through 



2fH- 



(5.3) 



Within effective field theory we expect a = 0(1). Since / is an energy scale, its value 
could range several orders of magnitude. For this reason, it would make more sense to 
expect log spaced intervals to be equally likely, i.e. we should consider a log prior on ^*. 
Fortunately, as is apparent from eq. (5.2) it is straightforward to change priors after the 
computations of the likelihood through an MCMC (which is the computational intensive 
part). For completeness we consider both a flat and a flat log prior on ,^* and discuss the 
difference between the results in the next section. 

5.3 Results 

In figures 4 and 5 we have plotted both the primordial and late time power spectra for 
several values of ^* and best-fit WMAP7 values of all other ACDM parameters. Large ^* 
results in a poor fit for WMAP data. Because of the blue tilt generated by ^* > 2.5, data on 
the power on small scales should further constrain the value of ^* . These small CMB scales 
have been probed by ground based experiments such as the Atacama Cosmology and South 
Pole telescopes (ACT & SPT). We use ACT data and our results are summarized in table 
1 (section 1.1). We consistently account for the tensor contribution to the temperature 
power spectrum. From current CMB data we know that the best-fit model prefers r = 0, 
so we expect to find a worsened likelihood compared to ACDM + r for the quadratic model. 



15 



8000 



^ 6000 h 



4000 



+ 
■^ 2000 



1 1 1 


1 1 ■ 1 1 


1 


1 , ' 


' 1 


. 






/ / 


\ \ 
\ \ 


. 






/ / 


'. 1 


■ 








W ' ■ 








/ // 










/ -J 


Vl'i / 1 ' ■ i 


" 






/ iu 




- 






./# 


\Jf^h " 


T 






<y'J^ 


vu ' ■> 




T 




-^^=*^ 


nl 


■ p^^ 




i^"*'^ 


f^^ 


^i ; 


1 

1 1 


1 1 1 1 1 


1 


1 1 


1 



10 



100 



1000 



Figure 5: The late time spectrum for ^* = 0, 2.68, 2.8, 3 and using for all the other parameters the 
best fit values of ACDM (obtained for ^* =0). We also show the binned data points and errors 
(systematic and cosmic variance). 



Quadratic Model: Combining WMAP and ACT data yields the strongest constraint, 
i.e. ^* < 2.41 and ^=k < 2.21 for the flat and log-flat priors, respectively. Surprisingly, 
the constraint from WMAP alone is worse than naively expected. After inspecting the 
likelihood, we found that WMAP alone allows for a larger value of ^*. In fact, the best-fit 
point prefers ^=f = 2.68. First of all, the actual CMB data prefers no tensor modes, with 
a best fit WMAP7 value of r = 0. For the quadratic model we include a non-zero r, 
which is consistent with the models prediction. The tensor modes contribute to the low /, 
predominantly up to / ~ 200. To compensate for the increased power at low / we can fit a 
smaller value of A7^^sr(^*)'^i the primordial power. The power is reduced mostly to fit the 
first peak, because that is where the WMAP data is most constraining. Now, the third 
peak and the second peak have too much power. To solve this fifc/i^ is decreased, which 
leads to a shorter diffusion length, which equates into more damping on small scales. This 
is true in case ^^ is small. For larger ^^k, the overall power can be reduced to fit the first 
peak and have relatively less effect on the second peak. Although this might sound counter 
intuitive, note that the quadratic model has a red tilt, which causes the overall power 
enhancement in both peaks to be similar. The second peak and the third peak still have a 
little too much power (as can be clearly seen from figure 7), but fitting the second peak has 
less priority compared to the first, while the third peak is simply not measured well enough 
by WMAP. An improved measurement of the third to seventh peak with ACT reject this 
fitting, hence the relatively large improvement of the constraint on ^*. Figure 6 shows a 
clear correlation between ^* and il^/i^ for large values of ^*. A slightly smaller value of 
^7^,sr(^*)^) prefers a larger value of fife/i^ and ^*. In figure 7 we show that the maximum 
likelihood point ^=f = 2.68 is in fact a reasonable good fit to the WMAP data. We also found 
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Figure 6: The two dimensional exclusion contours from WMAP and WMAP+ACT for J7b/i^ versus 
^*. WMAP alone allows for a better fit when both ^* and ilbh^ are relatively large. This explains 
the weaker constraint. 

that this correlation practically disappears without primordial tensors, which suggests that 
the improvement becomes redundant compared to an improvement from having zero tensor 
modes. We would like to stress that there are more factors involved in this best fitting than 
we have currently addressed. For example, there is also a role of Q.DMh'^, which seems to 
be correlated with ^^k as well. 

Generic Model: As explained in section f .f for a generic model we introduce two extra 
parameters, {e^,r]^}. Using WMAP plus ACT data, we find S,^ < 2.5 and ^^ < 2.14 at 95% 
CL for the flat and log-flat priors respectively. Note that the constraint from WMAP is 
actually improved; the power in tensor modes is now a free parameter, and as explained 
the data prefers r = which removes the preference for large ^* . We did find that flat-prior 
constraints are relatively sensitive to the point e* = ry* = 0, in which case increasing S,* just 
affects the overall amplitude and is therefore degenerate with A7^^sr(fe*)^- We qualify this 
in figure 8, where we show the 2 dimensional sample plot between r;* and ^* as a function 
of the value of e*. There exist some residual points right around ry* = e=f = precisely 
due to this effect. The constraints are therefore relatively unstable. We ran multiple series 
of chains for which we found that the GR diagnostic was not satisfied despite the use of 
large samples because the chains tend to wonder around e* = ry* = for extensive periods. 
Consequently, the chains gradually 'de-converge'. This is shown in figure 9, where we plot 
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Figure 7: The best fit Ci from the MCMC using WMAP data alone for a quadratic potential. 
Despite the relatively larger value of ^ of 2.68, the increased value of fi^/i^ and smaller Ag can 
counteract this effect at small angular scales. Adding ACT, with a better hold on small scale BAO 
peaks, or putting the tensor contribution to zero will no longer prefer this weighted cancellation. 



the 'route' to convergence as a function of the total number of samples. Eventually, the 
chains do converge, but the results differ slightly from run to run. Because of the this we 
do not trust the third digit in the constraints on ^^k with the flat prior. On the other hand, 
when applying the log prior to ^=k this effect is partly circumvented, since large values of ^=i, 
are relatively unlikely. Instead of applying the log prior in postprocessing the chains, we 
sampled ^* with a log- flat prior while running the MCMC. 

After inspecting the chains with a log-flat prior on ^* we found that, on average, about 
1 chain out of 4 spoiled smooth convergence, resulting in long periods of no improvement in 
convergence as opposed to deconvergence for the flat prior. For our final constraints we only 
considered the chains that show consistent values for ^^ as a function of the percentage of 
samples we ignored, i.e. changing the fraction from 20% to 50% changed the final constraint 
.^=1, less then a few percent. 

Summarizing, the robust bounds are obtained using a log-flat prior in the MCMC and 
result in ^* < 2.18 and ^* < 2.14 for WMAP and WMAP plus ACT respectively at 95% 
CL. 



6. Future power spectrum constraints 

Planck data as well as small scale data from ACTpol will be soon available. In this section 
we forecast the constraints on ^* that this new data will allow. The main results are 
summarized in table 3. 
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Figure 8: A multidimensional sample plot, showing 'residual' samples around the line 77* = e* = 0. 
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Figure 9: The Gelman-Rubin diagnostic as a function of the total number of samples for chains 
with a flat prior on S,* (solid) and a flat log prior on S,* (dashed) for the generic model. Chains with 
a flat prior have a tendency to 'deconverge' once the chain is in the vicinity of rj^, = e^ = 0. This 
results in varying constraints on ^^ from run to run. 

6.1 Analysis 

In order to construct a mock datasets we modified an existing code by Laurance Perotto 
and Julien Lesgourgues [39], which simulates the noise and cosmic variance from spherical 
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Figure 10: The four panels show the marginalized likelihood as function of ^*. ^* > 2.5 is ruled 
out so the likelihood goes quickly to zero, while C* ^ 2 has no appreciable effect on the power 
spectrum such that the likelihood becomes constant. 

harmonic aim, randomly generated from a fiducial signal plus noise spectrum distribution 
C7 = C^ + iV;^' , i.e. 
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(6.1) 
(6.2) 

(6.3) 



Here G^ are Gaussian random numbers with variance 1 and T= temperature, E = iiJ-mode 
polarization and d is deflection due to lensing. We do not consider i?-mode polarization. 

One note of interest. We found that not implementing noise through randomly gener- 
ated spherical harmonic coefficients aim-, seems to systematically underestimate the power 
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when running an MCMC. This underestimation is particularly troublesome for our forecast 
in case of a non-zero ^* , which introduces a running that increases power on small angular 
scales. When simply adding white noise to the fiducial Ci, through a random Gaussian 
with variance 1, we found that the MCMC would consistently derive a lower value of Hq, 
driving ,^* away from its fiducial value. One explanation would be that we try to a fit a 
Gaussian likelihood to an actual likelihood that might not be Gaussian, which could lead 
to an underestimation of the true likelihood value. Given the large deviation from the fidu- 
cial input values, we did not trust the derived confidence levels. On the other hand, the 
method we use, prescribed through 6.3, does recover all fiducial input parameters within 
la, indicating that it does not suffer from the same issue. 

We have build a mock data sets for Planck and ACTpol. We have considered /fj,""'^ = 
0.85 at 3 different low frequency bands (70, 100 and 143 Ghz) and /^J^^"' = 0.6 at 148 Ghz. 
The later is the wide field survey planned for ACTpol, with a temperature noise variance 
AT = 21/iK/arcmin per 1.4' arcmin beam and yz larger for polarization. The estimated 
Planck temperature noise level for the respective frequency bands is 12.8 fiK, 6.8 /xK and 
6 fiK per arcmin beam ( 14', 10' and 7.1'). We also considered E'-mode polarization, with 
noise levels of 18.2 /iK, 11.4 fiK and 6 fiK, again per arcmin beam. 

We created two different mock data sets. First, we assume ^ = ^* = 0, e* = and 
r/* = 0.04, resulting in red tilt of 0.96. A reason to choose e* = 0, which implies no 
tensor contribution, is driven by trying to avoid r/* = e* = 0, where ^^, becomes completely 
degenerate with the power spectrum amplitude^^. Planck should be sensitive enough to 
be able to exclude Ug = 1 for an input of n^ = 0.96. Although the mock data set does not 
contain any tensor signal, in the MCMC we consistently consider the tensor contribution 
to the temperature power spectrum, which is determined by e* (or r). This breaks the 
degeneracy of e* with 77,, . 

Secondly, we consider a value of S,* near the 2a level of WMAP + ACT, i.e. 1^* ~ 2.4. 
Note that there is no need to generate B modes since B modes by themselves do not 
improve the constraint on r and hence on e^,. 

6.2 Results 

We ran 4 MCMC chains with mock data sets with ^* = for Planck and Planck + ACTpol. 
The results are shown in table 3. We find that in the absence of a signal, Planck and Planck 
+ ACTpol improve on current constraints^'^. 

Adding a fiducial signal of S,* = 2.4 to the mock datasets, is easily detected in both 
Planck and Planck + ACTpol analysis, with a very sharply peaked marginalized distribu- 
tion. There is some skewness in the error bars due to the correlation with ?/* . The addition 



^^Alternatively, one could have also chosen e, = 0.02 and r;* = 0, but we found that this choice does not 
show the stability of the first choice. 

^^If one does not fit for Asz, we found there exists some induced degeneracy between Qbh^ and ^», which 
results in a somewhat better fitting near larger values of ^, , similar to the effect we found for the quadratic 
model fitting due to tensor modes. This effect weakens the constraint. In addition, we created a mock set 
with the same experimental characteristics of ACTpol, but with Imin ~ 2. We found that even though 
the beam is much smaller for ACTpol compared to Planck, the larger error and the use of a single band, 
actually lead to a weaker constraint on ^, than Planck alone. 
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Planck 


Planck + ACTpol 




flat prior (no signal) 


e* < 2.17 


C* < 2.12 




log prior (no signal) 


e* < 1.96 


^* < 1.92 




flat prior (^^ = 2.4) 


+0.08 
-0.07 


+0.06 
-0.04 




log prior {^^ =2.4) 


+0.08 
-0.07 


+0.06 
-0.04 





Table 3: Constraints on ^* at 95% CL derived from Planck and ACTpol mock data with (^* = 2.4) 
and without (^* = 0) signal. For the mock data containing a signal we quote the 95% CL error bar 
around the maximum likelihood point. 

of flducial ACTpol data reduces the 2a error bars by about 25%. This analysis shows that 
a signal sufficiently far away from zero could be detected by the Planck satellite and adding 
ACTpol will probably allow a detection of a weaker signal (closer to zero) . Changing from 
flat to log prior does not change the constraints due to the sharp peak of the likelihood 
around ^* ~ 2.4. Running a chain with a log prior require some special caution because of 
the sharpness of this feature in the likelihood landscape; in an MCMC approach it can be 
easily missed, and one would incorrectly obtain some strong upper bound on ^^,. Even with 
a very conservative G-R bound, one should allow for a sufficient running of the chains to 
assure this sharp feature in the likelihood is recovered. A good diagnostic for the log-flat 
prior having missed the signal is an anomalously weak upper bound on ^*. We conclude 
that it is advisable to run both a flat and a log prior and check for consistency. 

The analysis in this section supports the fact that current constraints are consistent 
with ^=K = 0, since a mock data set with signal yields very similar results as the analysis 
of the real data. 

7. Massive gauge field 

It is interesting to ask what happens if the gauge field coupled to the inflaton gets a mass. 
In this section we consider a model in which this happens due to a Higgs-like held h that 
develops a vacuum expectation value (vev) . Perturbations of h affect the efficiency of the 
tachyonic enhancement of A which in turns changes the number of e-foldings. Therefore 
perturbations in h are converted into curvature perturbations and can affect the late time 
observables. We point out a regime of parameters in which observably-large local non- 
Gaussianity can be generated remaining compatible with current data. 

7.1 Equations of motion 

Let us start generalizing the action (2.1) with the addition of a complex scalar field h 
charged under A with some potential W{h) 

— / 



\id<f>f + \Dh\^ + \f' + f^FF + V{<P) + W{h) 



d^x. 



-9 



(7.1) 



where D^ = d^ + ieA^. Notice that it is natural for the scalar potential to be additively 
separable because of the shift symmetry of (j). For the effects we will consider, mixing 
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with gravity gives just a small correction so we make the approximation of an unperturbed 
metric. This is tantamount to work in the spatially flat gauge, disregard vector and tensor 
modes and neglect the slow-roll suppressed interactions coming from the solution of the 
GR constraints. Taking the metric to be unperturbed FLRW, the six equations of motion 
plus one constraint are 
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(7.2) 
(7.3) 
(7.4) 

(7.5) 
(7.6) 



with the totally antisymmetric tensor e'^i^s _ _j_-|^_ Focussing on the constraint, and choos- 
ing the Coulomb gauge V • ^ = 0, one finds 



adfA° = ^V(p ■ V X A + 2e'^A^\h\'^ + 2elm ( hh' 



(7.7) 



We now consider the case in which W{h) has a minimum for some \h\ = const and study the 
dynamics of the gauge field around the homogeneous background (j) = 4>{t) and \h\ = const. 
The following derivation parallels the ones in [29]. The constraint (7.7) is solved by 
A^ = 0. Then the equations for the spatial components become 



A" - dfA- ^V X 1+ 2e2|/i| Vi* = . 
/ 



Let us quantize the gauge field as 



Aix,t)= Y. 



r=+, 
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(27r)2 



(7.8) 



(7.9) 



ar{k)Ar{k,t)e^'^'^er{k) +h.c. 
where the polarization tensor obeys k ■ e±{k) = Q, k x e±{k) = ^ike±{k), e±{—k) = e{k)* 

ar{k),al{k')] = {27rf6rr'6'^ (k - k') . (7.10) 



and (^ ■ Er' = drr'- Also 



(7.8) can be rewritten as 



.2^2/ce , rn\ 






^±=0, 



(7.11) 
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Figure 11: The plots show \A+\ in (7.14) as function of —kr around horizon crossing —kr = 1. 
In the left panel we have taken ttia = and ^ = 0, .2, .4, .6, .8, 1, in order to show the tachyonic 
enhancement. In the right panel we choose ^ = 3 and mj^/H = .2, .5, 1.5, 3 in order to visualize 
how TUA leads to a decay outside of the horizon and quenches the tachyonic enhancement already 
around horizon crossing. 

where we defined m\ = 2e^|/ip. Without loss of generality, we assume .^ > 0. Consequently 
the only mode that can undergo a tachyonic enhancement is A^. We will consider this 
mode only and we will drop the subscript from here on. 

Before proceeding to find a formal solution of this equation it is useful to pause and 
understand the qualitative effect of the various parameters on the solution. Let us rewrite 
(7.11) using y = log(— r) 



A 



vv 



Ay + 



m. 



{eykf-2{eyk)i+^ 



A = Q. 



(7.12) 



For mA = 0, which is the model studied in [29] and in the previous sections, A undergoes an 
enhancement (exponential in ^) around horizon crossing, before freezing out. This and the 
behaviors discussed in the following are confirmed in figure 11, which displays the numerical 
solution of (7.11). Intuitively this is analogous^^ to a (damped) harmonic oscillator with 
an imaginary frequency uj^ ~ (e^/c)^ — 2(e^A:)^, i.e. a tachyon. Far outside of the horizon 
e^k = —rk <^ 1, the friction term in (7.12) takes over and the solution asymptotes a 
constant. For small masses, i.e. rriA ^ H^ the exponential enhancement still takes place, 
but far outside of the horizon A decays slowly (as an overdamped harmonic oscillator), 
A ^ T^ . As we increase ruA for fixed ^ two things happen. First, the tachyonic instability 
becomes smaller and smaller and eventually disappears when ruA/H > ^. Second, the 
mode decays fast far outside of the horizon, A ~ e^' ^/c. 

A formal solution can be found by re- writing (7.11) in the form of the Whittaker 
equation 






1 A 

4 z 



1/4 -/.2 



z" 



Wx,f,{z) = f), 



(7.13) 



^^Actually around and after horizon crossing, e^k = — rfc < 1, the frequency varies non-adiabatically, so 
this discussion sliould serve just to lielp intuition. From tlie exact solution presented in the following one 
can verify explicitly the quahtative behavior discussed here. 
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where W\^^{z) is the Whittaker function. Using the large \z\ asymptotic behavior Wx^fj_{z) — )■ 
g-2/2^A g^^^ ^]^g Bunch-Davies vacuum A-^^ ~ e~^^'^ /v2k, one can fix both integration con- 
stants in the general solution of (7.11) 

A+ = ^W.,^,,{2ikT), (7.14) 



where fi^ = 1/4 — m^/H"^. 

7.2 E-foldings 

In this subsection we want to estimate how the total number of e-foldings is affected by 
m^ = 2e^|/ip. This will allow us in the next subsection to estimate the conversion of h 
into curvature perturbations using the 6N formalism. 

The quantities that affect the total number of e-foldings N are those appearing in 
the homogeneous equations (2.2) and (2.3): —FF/A = E ■ B and E"^ + B^ . They can be 
computed from 

\(S' + S') = -^^j^dkk' [l/l'+l^ + tV+l'] ■ (7.16) 

where E = —A! jo} and B = V x A/a^. Using the variable x = —kr, the solution (7.14) 
and the fact that the background is very close to de Sitter, which sets a ~ —1/{Ht), we 
can rewrite these integrals as 

{E-B) = ^e^^l dxx^d,\W_i^^^{-2ix)\\ (7.17) 

-{E^ + B^) = ^e«M dxx' [\d^W.^^,^{-2ix)\^ + \W.^^,^{-2ix)\''] . (7.18) 

This form makes it clear that both quantities are slowly varying functions of time through 
H(t) and ^(t). Both integrals are UV divergent, as could have been expected since we 
are multiplying fields at the same spacetime point. This UV divergence has nothing to do 
with the gauge field production, which takes place near the horizon exit of every mode. 
Following [3,29] we regularize these integrals by restricting the integration over the interval 
< X < 2,^. One can check that {E ■ B) is always more important during inflation than 
{E"^ + B"^) so that the latter can be neglected. In figure 12 we plot {E ■ B) as function 
of mA/ H and ^. As previously argued, large tachyonic enhancement takes place only for 
C > mA/H. 

What we have learned so far allows us to understand how the total number of e-foldings 
N depends on the parameter vriA/ H. During infiation ^ evolves slowly, being proportional 
to \/2e. The regime we study in this paper is when ^(r) is small enough that {E • B) is a 
negligible correction to the background dynamics at the time cosmological perturbations 
k < 1 Mpc~^ left the horizon. By the end of inflation though, ^(r) can grow such that 
{E ■ B) starts backreacting on the homogeneous evolution by slowing down the infiation. 
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Figure 12: The plot shows log ( {E ■ B)a^ J computed numerically as function of ttla/H and S,. A 
sizable contribution is generated only when ^ > niA/H. 

If N^: is the total number of e-foldings between the end of inflation and when some pivot 
cosmological scale A;* left the horizon, we define 

AN{^,mA) = N^i^niA) - N4C = 0,mA = 0). (7.19) 

For example, in the case of a monomial inflation potential such as m?'(fp' and niA = one 
finds /S.N{mA = 0) ~ 0(10) additional e-foldings due to this strong backreaction regime. 
On the other hand, as we can see from figure 12, if we increase niA/H from zero to ^, 
the tachyonic enhancement of {E ■ B) disappears and the additional e-foldings with it, 
i.e. AN{mA/H ^ ^) ^ 0. Addiontionally we also know that {E ■ B) and therefore AA^ 
depend quadratically on niA- Putting all of this together we deduce that lS.N[mA) must 
look like a downward bell, with maximum at ruA = and asymptotically approaching zero 
for itia/H 3> ^. This is confirmed by the numerical computation shown in figure 13, where 
we plot AN for fixed £, as function of the ratio mA/{^H^). 

7.3 Local non-Gaussianity 

In this subsection we use the 6N formalism [32, 33] to estimate the conversion of pertur- 
bations of the Higgs field h into curvature perturbations. We find that there is a regime of 
parameters in which observable local non-Gaussianity can arise. This regime requires the 
Higgs field h to have a small mass compared to H, which technically, at the level of the 
present construction, is not a natural assumption. 

The 6N formalism [32,33] provides a relation between curvature perturbations TZ and 
the different amount of expansion that different regions of the universe experience. More 
specifically, at scales much larger than the horizon when gradient terms are negligible 
(separate universes approximation), curvature perturbations TZ evaluated at some time t 
on a constant energy-density hyper-surface are equal to the perturbations in the number of 
e-foldings 5N between that same hypersurface and an initial flat hypersurface at ij. When 
the number of e-foldings is a function of the value of a set of fields at time t, it can be 
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Figure 13: The plots show the additional e-foldings AiV due to strong-backreaction of {E ■ B) as 
function of the ratio ■mA/{£,H^,) for fixed ^ = 2.4 on the left and ^ = .3 on the right. The plots are 
produced using a quadratic inflaton potential and H^, = 5 x 10^^ is the Hubble parameter at some 
pivot time during inflation. This downward-bell shape can be easily understood. For niA = the 
tachyonic enhancement is maximized and so is A7V. For itia/H ^ ^ the tachyonic enhancement is 
completely quenched and AiV = 0. The plot is symmetric around zero since {E ■ B) is a function 
of m^4 . 



expanded in a Taylor series. One finds 



7^(i) ~ Yl ^,1^^'iti) + IY1 Njj5ip\ti)^\ti) + O (5(/.3) 



(7.20) 



ij 



where Nj = dN/d(p^{ti) and similarly for Njj. In the model considered in this section 
(7.1) we have two scalar fields (p and h and the vector field A. When the backreaction of 
{E ■ B) is negligible at the time cosmological perturbations k < lMpc~^ leave the horizon, 
the perturbations in the gauge field do not affect curvature perturbations. This is the 
result of the fact that the integral in (7.17), which determines the size of {E ■ B), is almost 
entirely supported around horizon crossing x ~ 0{1). Perturbations in A of cosmological 
size k < lMpc~^ affect {E-B) only when this is a negligible correction to the homogeneous 
equations of motion. Later on, closer to the end inflation, {E ■ B) starts to dominate the 
homogeneous dynamics, but the modes that cross the horizon at that moment correspond 
to scales tens of orders of magnitude smaller than those relevant for cosmology. 

Things are different for the scalar fields if they are lighter than H. The inflaton must 
be light because the shift symmetry is only weakly broken by the slow-roll flat inflaton 
potential. The mass of \h\ around its minimum (or W{h) in (7.1)) on the other hand is a 
free parameter. Not surprisingly interesting phenomenology arises only when mi^i <^ H. 
Henceforth we will only consider this regime. The mass of h is not protected by a symmetry 
and in that sense a small mass is unnatural. Let us estimate the various derivatives 
appearing in (7.20) starting with the inflaton. N^^ ~ {2e)~^''^ ~ 0(10), and we can neglect 
N^^fj) since single field slow-roll inflation produces negligible non-Gaussianity [34,35]. The 
cross derivative N^h<f> is smaller than N^hh and therefore we neglect it as well. Moving 
on to derivatives with respect to h, we use the results of the previous subsection and 
dh = \piedmA- ^^"^ "^-4 ~ ^^ o'^^ "^^^ approximate ^N{mA) as a downward parabola. 
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One hence find the order of magnitude estimate 

N, ^ V^ed^^^N ~ e^^^ , (7.21) 

where AA'^max = AN{mA = 0) is the largest amount of extra e-foldings for fixed ^ and 
fixed potential (see e.g. figure 13). For the second derivative, a similar estimate gives 

1 
Combining these results we find the order of magnitude estimate 



\hh ^ 2e^dl^N ~ e^AiV^^ax 7777^2 • (^-22) 



^ '5</' / X 1 /9 ^h AA^max 

^~ T7-(2e)" ^ +— e- 



where for massless fields we can use the standard result 



ruA e 6h 



(7.23) 



hV 



{5h') c^i27r f5HK)^^j . (7.24) 

A useful formula for local non-Gaussianity in the 5N approach [33] is given by 

Imposing the COBE normalization A^(/cp) = 2.4 x 10~^, one obtains the following estimate 
for the late time local non-Gaussianity produced by the the conversion of 6h into TZ 

eiO-3 I [^Hj 



fNl-lO^r^] f^y. (7.26) 



One has to check that the contribution of 6h to TZ does not exceed the COBE normalization, 
this is guaranteed if 

#e2AiV^ax V + ^J^^- ^^-^^^ 

It is now easy to check that observable^^ local non-Gaussianity can be generated in this 
model, for example ^ ~ AiVmax ~ 1, mA/{CH) ~ 0.1 and e = 2 x 10"^ gives /]^£ ~ 40. 

floe 
INL 



Here we would like to emphasize that /)S? can range from very large to very small number 



and the volume of parameter space in which /]^2 is detectable by say Planck but not yet 
ruled out by WMAP is a small fraction of the total volume. 
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^'^WMAP? [30] imposes the bound -10 < f^l < 74 at 95% CL, and it is expected Planck will be sensitive 
to Afj^l ~ 5. 
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A. On the scale dependence of the power spectrum 

Following [29] we will now re-derive the inverse decay corrections to the power spectrum 
(1.3) with particular attention to the slow-roll deviations from scale invariance. 

Let us start considering the simple case of single field inflation, without any gauge 
field. One can expand the action around a homogenous slow-roll background to quadratic 
order in the perturbations. One can fix a flat gauge in which^^ gij = a^dij. After solving 
the constraints coming from the Einstein equations, e.g. as in [34], one flnds the equation 
of motion (see e.g. [6]) 






V^ + 2aH— + a^ I V" + 167rG 

OT \ 



|r.«,G|, 



0. 



(A.l) 



We have written out the Newton constant explicitly to show which terms arise from the 
interaction with gravity through the constraint equations. In order to keep all slow- roll 
corrections into account, in (A.l) we should use^^ 

1 



aH 



(1 



We also find it useful to use the variable x 



= — fcr and quantize according to^^ 



Ak-x 



bkbr + h..c. 



hA ={27rf6(k-k' 



Then (A.l) can be rewritten as 
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+ 1 



2(1 



d 



+ a^H'^0{e^,7]^) 



6(1) = Q, 



(A.2) 

(A.3) 
(A.4) 

(A.5) 



dx^ ' X dx 

where 0(e^,, 7/=,,) = 3 (3e,, — r/^,/2) is some linear combination of e* and 77*, which as we will 
see is not relevant to compute deviations from scale invariance at leading order in the slow- 
roll parameters. A positive frequency solution of (A.5) is proportional to x^''^'^'"'' HJ) (x) 
where z/ = 3/2 -|- 0(e*, ??*) and the exact expression for the slow-roll correction is again not 
important for our purposes. Imposing the Bunch-Davies vacuum at some initial moment 
X = Xi ^ 1 one can fix the (second) integration constant C^ 

'2 ^H{Ti) 

vr 



CkX' 



3/2H 



'Hl^\x) 



Cfcx}+^* 



- XjC 



(A.6) 



/2fc3 

up to an irrelevant phase. Note here that H^^> is the Hankel function, not to be confused 
with the Hubble parameter H, appearing on the right. The time dependence of the Hubble 
paramater H at linear order in the slow-roll parameters is 



Hiri) = H(t2) 



•> 



(A.7) 



^®We neglect vector and tensor perturbations. 

^^Notice that in the left hand side of (A.2) both H and a are functions of r, as can be seen by taking the 
derivative of both sides with respect to r. Unless a * is present, all quantities are functions of time. 
^*Note that we have different conventions with respect to [29] in both the factors of 2-k and of a{t). 
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so that 



Ck 



7rH{- 
2 ] 



Xi/k) 



/2F 



(Ai 



is independent of the choice of x, at the order we are working. On the other hand, the scale 
dependence is non-trivial: S(j)x=i oc C^ ex A;^^'^"'^*. This constitutes one of the two sources 
of deviation from scale invariance. Our final goal is to compute the (gauge invariant) 
variable [24] 



n = -'^ 



H 



(A.9) 



with gij = a 6ij (1 — 2'!'). Outside of the horizon TZ has a constant and a decaying mode. 
To isolate the constant mode we first take the small x limit of 66 



TT 



-2''T{u)CkX 



3/2+6^-1/ 



(A.IO) 



Now we have to convert S(j) into TZ using 6(j) = TZy/e, where e depends slowly on time 
and gives the second and last contribution to deviations from scale invariance. For the 
purpose of computing deviations from scale invariance, the conversion can be done at any 
X, as long as this is the same for every mode so that we do not introduce any spurious 
scale dependence. This makes is clear that slow-roll corrections to i' in (A.IO) only affect 
the amplitude, but not the scale dependence of the result since they do not affect any 
/c-dependent quantity. In practice, the overall amplitude is captured most accurately if the 
conversion from 5(j) to TZ is performed around horizon crossing for each mode, i.e. x = 1. 
Also, since C^ does not depend on Xi we can simply rewrite it evaluating H at horizon 
crossing. The final result is then 



\TZkix 



m 



H^ 



4efc3 4e^/i;3 



-2e,-»y, 



(A.ll) 



where — 2e* and r/* come from the scale dependence of H^ and 1/e, respectively. 

Summarizing, in perfect de Sitter H is constant, every mode 5cj) goes through the same 
history on the same background and the resulting spectrum for 6(j) is scale invariant. In a 
quasi de Sitter expansion as relevant for inflation, H and its derivatives vary slowly with 
time. As different modes go through the same history, they experience a slightly different 
background, due to the decrease in H as inflation proceeds, and a slightly different factor 
■\/2e for the conversion into TZ. As long as we keep the momentum dependence of these 
two factors into account, we can ignore those slow-roll corrections, e.g. in the last term of 
(A. 5), which lead to only small corrections to the overall amplitude. 

Let us now turn to the problem at hand 



^-V' + 2aH^ + a'V" 
OT'^ or 



)« 



E-B- {E-B] 



(A.12) 
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or 



^' +k^ + 2aH-^+a^V" 



dr^ 



dr 



M^) > 



M-) 



,a 



f 



a-- I d-'xe-"'-''E-B{x), 



(A.13) 
(A.14) 



Since we have already discussed the homogeneous part of the equation we know where 
the scale dependence will arise in solving the inhomogeneous part with the Green function 
method. Let us compute {TV^ Ti}^ ) using 



^^'•M = ^r dr'G,{r,r')Mr'), 



'^ ^ ' (pik) J-oo 
where the Green function is given by 

Gfc(r,r') = ie(T - r') [5Mr)SMr') - h.c] 



(A.15) 



(A.16) 



As discussed previously, k dependence in 6(p comes just from C^. Using this information 
and repeating the derivation of the power spectrum presented in [29], one comes to the 
final result 



(R}f-T^-) = {2.f5\k + k')hm))e'<^'^''-^, r^"^^^' 



1 



(27r)2 2e,(fe) 



(A.17) 



where 



/2(e) 



Sit 



(fq 



1 + 



q^ — q- k 

q\k-q\ 



q\k 



y/q + y\k-^ 



xX' 



seu^+vi^ 



with 



X{x) 



dxx^/'^Re hP 



— ki 



-Zs/x 



(A.18) 



(A.19) 



In (A.17), e^'^^^ ' has been taken out of the integral appearing in the definition of J2- This 
can be done because, as explained in [29], the integral is supported around \q\ ~ |A;|. We 
can then rewrite the whole power spectrum, including both vacuum and inverse decay 
contributions as 



{n{k)n{k')) = {27rf 6 ik + k 



r>\ 27r^ 



A;3 



H' 



1 + 



h! 



27r|(/., 



27r| 



2 / ^ \ "s-l 



ris-l 



/2(e(fc))e^-«W 



with 



n,c - 1 



-2e* - r/^. ~ 6ey + 2r]v , 



m = e* 



l + |lo, 



+ 0{e' 



(A.20) 
(A.21) 

(A.22) 
(A.23) 
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Here the star in H^, and in other scale dependent quantities indicates that these quantities 
are evaluated at horizon crossing of a pivot scale, which, for concreteness, we have chosen 
to be h = 0.002Mpc-^ 

We have one technical comment on computing I numerically. As explained in [11], we 
are only interested in super horizon modes, i.e. modes with —kr <C 1. Therefore, we can 
safely put the lower limit in the integral to 0. As we have reviewed, the slow-roll corrections 
to z^ = 3/2 + 0{e, rj) do not affect the scale dependence of the correlation function, but just 
its amplitude. We can hence neglect them. The integral X can then be approximated to 
be 



Z{z) 



dx (sin X — X cos x) e 



-Zsfx 



(A.24) 



2 + v![^I^g |ei(^^— )/4 (^2 _ ^Q.^ g^f^ 



,i7r/4: 



+ 



+e^ 



-i(22-7r)/4 / 2 



+ lOi) Erfc 



-iir/A' 



where the second line is a semi-analytical expression useful in numerical computations. 
Erfc(x) = 1 — Erf (x) is the complementary error function. 

B. Technical details on the bispectrum constraint 

The three-point function is given by [29] 



<n%(-)) = /n 



air) "''•''■ 



(11 '^.-.(-^))- 



(B.l) 



It was shown in [11] that this results in 



(!!%.(-)) 
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HHh) 



^ (f){ki)k. 



6{Y,h 



S 



Qi 



(27r)£ 



ll[ei{-iy+'q^).e{em 



i+i) 



X / dTi[kiTi cos{kiTi) - sin(fciri)] x Aln, \qi\,\qi+i\ 



(B.2) 



For convenience of notation we defined q2 = ki — qi, q^ = k^ + qi, and q^ = iji. In addition 
we defined a function B with the properties 0(1) = 1, 0(2) = 0(3) = —1 while 



Air, [a], 16]) = \a\A,r{T, \b\)A{T, \a\) + \b\A,r{T, \a\)AiT, \b\). 



(B.3) 



In principle, A = A^ is given by the Whittaker solution (7.13). As was first argued in [11] 
interesting physical effects occur in the interval 1/8^* ^ —kr < 2^^. The solution for A 
can then be approximated to be 



AiT,k)- 



1 



-kr 



2k V 2^ 



1/4 



,7re-2V-25feT 



(B.4) 
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Using this approximation we can compute the integrals over conformal time in B.2, keeping 
track of all momentum dependence in ^ 

3 .0 p6^€((nkll)^/') 



JJ / dTi[kiTi cos{kiTi) - sin(fciri)] x A[Ti, \qi\, \qi+i\ 

■ J —oo 



1 1/2 



(B.5) 

In section 4 we argued that the ^ in the exponential should dominate the scale dependence 
of the bispectrum. As argued in [29], the support of the qi integral is for gi ~ fci ~ ^2 ~ ^3i 
which explains why the resulting shape is close to equilateral. Hence we approximate ^ in 
the exponential by 



6vrC((n m)'^') ^ 6<((n l^'D'^') ^ <* 



6 + ?7* log 



k^ 



(B.6) 



We have checked explicitly that using the arithmetic instead of the geometric mean does 
not affect our final results. This is to be expected since we compare only shapes that peak 
in equilateral configurations, where the two means differ only very little. Then we can 
approximate the bispectrum as 



fid 

Jnl 



^id 



A^,sr(^*)e'"«*/3(e*,l,l 



7?.V"'* 



fc2 M ' 



Efc? A«..t.f 



-(2.)^Ai(..)j^^, AK..U) 



n 



n^H: 



7r^*?7*+ris-l 



(B.7) 
(B.8) 



Defining qi/ki = Qi and Xj = ki/ki we can write /3(C*) ^2/^!; ^3/^1) as 

3 



/3(C*,a;2,X3) 



1 



j d^Qitim" [m" +u+i\"' 



37ra;23;3(l + x\ + x|) 
X (2y2e7^(|Q,|V2 + |Q,+i|i/2)) [e-((-l)*+iQ,) . e{Q[i)Qi+i)] . (B.9) 
/3 can be computed numerically by setting [11] 

ki/ki = ki = (1,0,0), (B.IO) 

and 

h/h = -^{l-xl + xl, ^/-{l -X2 + X3)(l + X2 - X3)(l + X2 + X3), o) , (B.ll) 

and noting that to a generic vector k = /c(sin^cos(/), sin0sin(^,cos0) there exists a corre- 
sponding polarization vector 

1 



e{k) = —^ (cos 9 cos <f> — i sin (p, cos 9 sin (f) + i cos (f), — sin 9) . 
v2 



(B.12) 



Using expressions (B.8) and (B.9) and this parametrization we computed the fudge factor 
of (4.11). 
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C. Computation with the gauge fields 

Here is a list of useful formulae 

F^o = a-2(i, + a.^O), (C.l) 

F'^ = a-^Fi, , (C.2) 

Ai = a^i (tf + Ob) - diA"^ , (C.3) 

dtia^F'^) = aA, + aHAi + dtiad^A^) , (C.4) 

d^{a''F>'') = -[aA^ - ^ + aHAi + ^t{a^^A'^)] , (C.5) 



d^{a^<PtF'^yf) = yVxA, (C.6) 

d^ia'^F.^'/f) = jVxAh- jiV^) X lb , (C.7) 

tf = aHAi + aij + at(a5i^° - jV x Ab - y V x 1) . (C.8) 

Using the Coulomb gauge V • ^ = and 

£ijk£ = (^j Ok - °j^k ■• (^-9) 



one finds 
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